
Calculating 
Group-By 
selectivity S 
(See Figure 2 
and Figure 3) 



Calculating 
Group-By 
result size: 
R = I x S 
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Let 
T= 1 
U= 1 
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For each table i 
in query, DO 
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For each column c in 
the Group-By list, DO 
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Get number of values Tc 
(number of rows in table) 



Get degree of uniqueness 
Sc (from statistics) 



Calculate number of distinct 
values Uc = Sc x Tc 
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Set 
U = UxUc 




Group-By selectivity 
S = U/T 
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Last table in query? 
Yes 
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Figure 2 



For each table i 
in query, DO 
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Get table size Ti 
Y 



Set Tmax = MAX (Tmax, Ti) 




Add equivalent columns 
to Group-By list 

V 



For each column c in 
the Group-By list, DO 
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Get number of values Tc 
(number of rows in table) 

I 
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Get degree of uniqueness 
Sc (from statistics) 

i 
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Normalize degree of 

uniqueness: 

Sen = Sc x Tc / Tmax 



Set 

S = S + Sen - (S x Sen) 




Let 
S = Sen 
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Figure 3 
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